User Behaviors Lend a Helping Hand: Learning Paraphrase Query Patterns from Search Log Sessions
نویسندگان
چکیده
Search log sessions contain a large number of paraphrases contributed by users during query rewriting. However, it is a big challenge to distinguish paraphrases from the simply related queries in the sessions. This paper addresses this problem by making innovative use of user behavior information embodied in query sessions. Specifically, we learn paraphrase patterns from the search log sessions with a classification framework, in which three types of user behavior features are exploited besides the conventional features. We evaluate the method using a query log of a commercial search engine. Experimental results demonstrate the effectiveness of our method, especially the significant contribution of the user behavior features. We extract over 250,000 pairs of paraphrase patterns from the used search log, with a precision over 76%. TITLE AND ABSTRACT IN ANOTHER LANGUAGE (CHINESE) Äu^r1A l|¢¬{¥ ÷Eã Î |¢Ú^r3|¢2~¬é Î?1ÓÂU §±1⁄4 Ð |¢(J§Ïd |¢F ά{¥1þ Eã] ", (J ̄K ́XÛ«© ά{¥ = Î ́Eã'X§= K== ́Â' ®" ©ÏLÐ/|^ ά{¥Û1 ^r1A 5)ûù ̄K"äN/§·Äu©a {l ά{] ¥ ÷E ã Î "3©a .¥§·Ø=¦^ DÚ Eã£OA §}Á n«^r1 A "·¦^û|¢Ú ^r|¢F5μ ©JÑ {"μ (Jy 2 ©{ k 5§cÙ ́^r1A å 2w^"·l¤¦^ ^rF ¥ Ä Ñ L250,000é Eã §ÙO(ÇL76%"
منابع مشابه
Analysis of User query refinement behavior based on semantic features: user log analysis of Ganj database (IranDoc)
Background and Aim: Information systems cannot be well designed or developed without a clear understanding of needs of users, manner of their information seeking and evaluating. This research has been designed to analyze the Ganj (Iranian research institute of science and technology database) users’ query refinement behaviors via log analysis. Methods: The method of this research is log anal...
متن کاملDiscovering Popular Clicks\' Pattern of Teen Users for Query Recommendation
Search engines are still the most important gates for information search in internet. In this regard, providing the best response in the shortest time possible to the user's request is still desired. Normally, search engines are designed for adults and few policies have been employed considering teen users. Teen users are more biased in clicking the results list than are adult users. This leads...
متن کاملParaphrasing with Search Engine Query Logs
This paper proposes a method that extracts paraphrases from search engine query logs. The method first extracts paraphrase query-title pairs based on an assumption that a search query and its corresponding clicked document titles may mean the same thing. It then extracts paraphrase query-query and title-title pairs from the query-title paraphrases with a pivot approach. Paraphrases extracted in...
متن کاملDetecting User Sessions in the Tumba! Query Log
This paper describes an approach to detect distinct user sessions from the logs of a particular search engine. We present our work by describing the proposed algorithm and some interesting usage patterns that were detected. Some pitfalls of our approach are also noted. Finally, we give some insights on how web log mining could be exploited in other areas such as semantic relation extraction or ...
متن کاملRRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012